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ABSTRACT 

In a manner similar to that used in the orthogonal 
case, formulas for the asymptotic standard errors of analytically 
rotated oblique factor loading estimates are obtained. This is done 
by finding expressions for the partial derivatives of an oblique 
rotation algorithm and using previously derived results for unrotated 
loadings. These include the results of Lawley for maximum likelihood 
factor analysis and those of Girshick for principal components 
analysis. Details are given in cases including direct oblimin and 
direct Crawford-Ferguson rotation. Numerical results for an example 
invol'«jing maximum likelihood estimation with direct quartimin 
rotation are presented. They include simultaneous tests for 
significant loading estimates. . (Author) 
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STANDARD ERRORS FOR OBLIQUELY ROTATED FACTOR LOADINGS 



Abstract 

In a manner similar to that used in the orthogoiial case, fonnulas for 
the asymptotic standard errors of analytically rotated oblique factor 
loading estimates are obtained. This is done by finding expressior.s 
for the partial derivatives of an oblique rotation algorithm and using 
previously derived results for unrotated loadings. These include the 
results of Lawley for maximum likelihood factor analysis and those of 
Girshick for principal components analysis. Details are given in cases 
including direct oblimin and direct Crawford-Ferguson rotation. Numerical 
results for an example involving maximum likelihood estimation with direct 
quartimin rotation are presented. They include simultaneous tests for 
significant loading estimates. 
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!♦ Introduction 

In an earlier paper Archer and Jennrich [1975] derived formulas for 
the asymptotic standard errors of orthogonally rotated factor loading 
estimates* Corresponding results are derived here for obliquely rotated 
loadings • 

We begin with a p by k factor loading matrijc A = 
asymptotically normal estimate A = (^^^) • Majcimum likelihood estimates 
in the classical factor analysis model [Iflwley, I967] and the principal 
components estimates in principal components analysis [Girshick, 1959] 
aro both of this form. We are interested in the effect of an oblique 
rotation algorithm h on the asymptotic distribution of A . A function 
h is an oblique rotation algorithm if it maps an arbitrary p by k 
matrix X into a p by k matrix Y = XT where T is a nonsingular 
k by k matrix whose inverse has normalized rows, i*e*, 

diag(T»T)'-^ = I , 

the k by k identity matrix. The value of T may, and generally 
will, be a function of X . We are interested in the asymptotic dis- 
tribution of the rotated loading estimates A = h(S) = AT and the rotated 
factor correlation estimates (I) = (T*T)""^ . Of special interest are the 
cases when h represents oblimin [Harman, 19^7^ P* 52^] or Crawford- 
Ferguson [1970] rotation. 



2* Asymptotic Distributions iinder Oblique Rotation 



Let A = h(A) = (A. ) denote the "true" rotated loadings and O = 



g(A) = (T'T)""^ = ^^uv^ rotated factor correlations* Assume that 

at A , h has a differential dh and g has a differential dg . 
Then 

(1) >/ir (A - A) I dh( (A - A)) 

and 

(2) y/^ - dg( (A - A)) 

where " = " means that the difference between the left- and right -band 
sides of (l) and (2) approaches zero in probability as the sample size n 
on which the estimate A is based approaches infinity [Rao, 19^5^ P» 521 ]• 
Since dh and dg are linear and A is an asymptotically normal estimate 
of A , A and 0 are asymptotically normal estimates of A and 0 
whose asymptotic covariance matrices may be obtained from that of A • 
In terms of the partial derivatives of h and g , 

(5) acov(A. ) = Z T-r^ acov(a ,a ) 

^ ' ^ ir' js' mnxy oa ^ mx^ ny' oa 

mx ny 



and 



3s S^c 
/ \ \ ^ \ rs ^ \ uv 

[k) acov(0 ,0 ) = 2: ^ acovfcy .a ) k 

^ ' ^ rs^ uv' mnxy oa ^ mx^ ny' da 

mx ny 



Formulas for the asymptotic covariances on the right^ i»e«, the asymptotic 
covariances f or specific unrotated loadings, were given by Lawley [1967] for 
maximum likelihood factor analysis and by Girshick [1959] for principal 
components analysis. (See Lawley and Maxwell [1971] for a convenient 
summary. ) 

For the oblique rotation algorithms used in practice, it is difficult 

to find dh and dg or, equivalent3.y, the partial derivatives 

and ^g /hoc. directly. As in the orthogonal case [Archer 85 Jermrich, 
uv' js 

1975] we proceed by means of implicit differentiation. Suppose that 

(5) 4^(A,0) = 0 

is a k(k - 1) dimensional constraint which is satisifed whenever A = AT 
is an h -rotation of A and 0 = (T'T)""^ is the corresponding 
factor correlation matrix. Constraints of this fonn will be derived in 
Section While it would be possible to express these constraints in 
terms of A and T rather than in terms of A and 0 , the latter 
parameters appear to be the more natural and lead to simpler formulas. 
It is not possible here, as it was in the orthogonal case, to express the 
constraints in terms of A alone. In the development of the general 
theory, this represents the primary difference from the orthogonal case# 
Differentiating the relations: 

(6) A = AT 

(7) a> = (T»T)'^ 



(8) diag $ = I 

(9) ^^(A,*) = 0 
gives : 

(10) dA = dAT + AdT 

(11) d* = Hl)(T'dT + dT'T)$ 

(12) diag d$ = 0 

(15) d\lr^(d/0 + d^^((l>) = 0 

when d\ir^ and d^^ denote the differentials at (A,$) of \|r vith respect 
to its first and second arguments respectively. Solving (lO) through (l5) for 
in tenns of dA will define the differential dh and the required partial 
derivatives of h • Similarly solving for d* in terms of dA will 
define dg and the required partial derivatives of g • This is the 
standard technique of implicit differentiation. The only novelty here 
is that some matrix algebra is employed. Let 

(Ik) K = T'""^dT$ so dT = TK*""^ . 

Using this change of variable, the linearity of d\|r^ , and simplifying 
slightly, equations (lO) through (l5) become 

(15) dA = dAT + AK*'""" 



(16) = -(K + K«) 

(17) diag K = 0 

(18) d\|r^(dAT) + (i\|r^(AKo'"^) - d^^(¥i + K«) = 0 • 

We shall solve the last equation for K in terms of dA . To this end, 
let be the space of k by k matrices which are zero on the diagonal 
and define the linear transfontiation 

(19) ' L(X) = d'^jAX^""^) - dilTgCX + X«) 

for all X in Because of (l?), K is in . Assuming L is an 

invertible linear transformation, {18) gives 

(20) K = -L"-^(d\|r^(dAT)) 
Substituting this expression into (l5) gives 

(21) dA = dAT - AL"-^(d\lf^(dAT))<D"-^ 

which expresses dA in terms of dA and defines the differential dh . 
Similarly substituting (20) into (16) gives 

(22) d* = -L'^(d\|r^(dAT)) - (L"^(dilr^(dAT)))« 

and this defines dg . Equations (2l) and (22) represent our basic 
results • The approach in the following sections will be to find constraint 
functions \|r suitable for various types of oblique rotation and then to 
recover the required partial derivatives of h and g from (2l) and (22) 
respectively. 



-6- 



3* Constraints for an Oblique Rotation Algorithm 

Oblique rotation may be applied directly to the loadings [Jennrich 
& Sampson, I966] or indirectly to the reference factor structure [Carroll, 
i960]. This gives rise to direct and indirect oblique rotation methods 
(see Harman [1967, p. 55^^] for a discussion). Because the present trend 
seems to be toward direct methods, we shall consider only that case. No 
corresponding decision was required when considering orthogonal rotation 
for there the loadings and reference structure are the same. 

Direct oblique analytic rotation algorithms are designed to optimize a 
criterion of the form: 

(25) Q - Q(A) = Q(AT) 

over all T which satisfy (?) and (8). The resulting A = AT is called 
a Q -rotation of A . It is not necessary that the optimizing A be 
unique. It is assumed only that the corresponding rotation algorithm 
h maps each A into an optimizing A . The criterion Q is usually 
a quartic function of the components of A but it may, as in the case of 

target rotation, be a quadratic function. Since there is no need to : 

> 

specialize at this point, an arbitrary Q will be considered here end \ 

in the next section. \ 

Let dQ. denote the differential of Q at A and assume that for a 

given A , A = AT optimizes Q for all T satisfying (7) a^irl (8). \ 

Then it is necessary that \ 

i 

\ 
1 



{2k) dQ(AdT) = 0 

for all dT satisfying (ll) and (12)* But using the change of variable 
in (Ik), this is equivalent to requiring that 

(25) dQ(AK(^^^) = 0 

for all K ±n ^ . Let K(u,v) be the elementary k by k matrix which 
has a one in row u and column v and is zero elsewhere. Then (25) is 
equivalent to requiring 

(26) = dQ(AK(u,v)0'^) = 0 

for 1 < u ^ V < k . These are consti^ints of the form (5) required in 
Section 2* In coordinate form the constraint functions are: 



P k 

i=l r=l ir 



(27) = S S ^ A. (t)^ 

^ ^uv . ^ ^ lu 



for l<u^v<k. In matrix form the constraints require that 



(28) A« If be symmetric. 



Here dQ./dA denotes the matrix {^Q,/^\^) of partial derivatives of Q 
Except for the appearance of ^^'^ these are the same as the constraints 
which arose in the orthogonal case [Archer & Jennrich, 1973]* 
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k* The Partial Derivatives of the Functions h and g 

We shall derive here explicit and computationally convenient formulas 
for the partial derivatives of h and g . To this end let j(j,s) be 
the elementary p by k matrix which has a one in row and column s 
and is zero elsewhere. In terms of their differentials dh and dg , 
the partial derivatives of g and h are: 



ah. 

(29) cih.^(j(j,s)) 
and 



for 1 < i, j < p and 1 < r, s,u,v < k . Another way to express (29) is 
to assert that 



(51) 



H = 



ir 



OS] 



is the matrix of dh relative to the basis 



(52) J(j,s) , l<j<P , l<s<k 

of the space of all p by k matrices* Similarly 



(55) 



G 



15^ 



uv 



is the matrix of dg relative to this basis. The differential dh is 
defined by the fundamental relation. (2l). We may view (2l) as a^ composition 
of four linear transformations and express the matrix H of the linear 
transformation dh in the form 

(3^) H = B - CD'-^ 

where B , C , D , and E are the matrices of the linear transformations 

(35) XH*XT , X e <^ 

(36) ZH»>AZ$'^ , Z e <^ 

(37) ZH* L(z) , z € 

(58) Xh*d^i(XT) , X€<??7 

respectively relative to the basis (5^) of ^ and the basis 

(59) K(u,v) , 1 < u ^ V < k 

of ^ . Similarly since dg is defined by the fundamental relation (22), 
the matrix G of the linear transformation dg has the form 

(^0) G = -(F 4 F«) 

where 

(^1) F = D^^ . 

From (55) the matrix B is given by: 

(^-) B. . = (j(j,s)T). = .T 

ir,js ^ ^ ^ir Ij sr 

where ( • )^^ denotes the element in row i and column r of the 
matrix inside the parentheses and 0. . denotes the Kionecker delta. 
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Similarly from (56) the matrix C is given by: 



Since D is the matrix of the linear transformation L defined by (19) 
it is given by: 

-1^ 



V,xy ^ t^M^^'^^y)^^^ " dilTgCKCx.y) + K(y,x))] 



uv 



f V \ uv "^uv 

1=1 r=l ir xy yx 



Finally using (58), E is given by: 



xy 



- dA. sr 
r=l or 



In each case the index ranges are l<i^J<P^ ^5. ^^^^^^ 
l<u/v<k,and l<x^y<?c. 

Equations (kk) and (k^) require partial deri-^^atives of the constraint 
functions \|r^^ • As discussed in the previous section these depend on the 
rotation algorithm employed. Specific formulas for the generalized Crawford - 
Ferguson family will bt obtained in the next section. 
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^. The Generalized Crawford -Ferfcuson Family 

There are two well-known families of fourth degree oblique rotation 
criteria, the oblimin criteria [Harman, 196?, P* 52^] 



^ P .2 ,2 7 P ,2 P .2 



cJs 1=1 1=1 1=1 



and the Crawford -Ferguson [1970] criteria: 



Tfs 1=1 ifO r=l 



Specific criteria in these families are obtained by fixing the values of 

7 and K and . In spite of the fact that the Crawford-Ferguson 

' 1 ^ - 

family contains an additional parameter, it does not contain the oblimin 
family. Both families, however, are contained in what we shall call the 
generalized Crawford-Ferguson family: 

^ i=l r=l i=l r=l ^ r=l i=l 

P ^ h 
+ Ki S 2 A. 

1=1 r=l 

This family becomes the Crawford-Ferguson family when = + Kg , 
^2 ~ "^2 ^ ^3 " "^1 ' '^ij ~ ^ becomes the oblimin family 

when = -7/p , Kg = 1 , = 7/P , and k^^ = -1 • Indeed, every fourth 
degree polynomial criterion which has the property that it is invariant 
under changes in sign of the rows and columns of A and under row and 
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column permutations is of the form given in (^8). We will derive our 
results for this complete family of quartlc criteria. Let kQ, equal the 
expression given in (48). It follows easily that 



ir 



where 

(50) M. = K,S.S A? + kj: A?^ + ^h^^y^ *^L^?T* 
^ ir 1 i,r ir 2 r ir ^ i ir ^ ±v 

for 1 < i < P and 1 < r < k . Thus using (27), 

P ^ vr / 

(51) \|f = S S A. A. M. 0 , 1 < u / V < k 
' uv . _ _ lu ir ir ' «. / — 

1=1 r=l 

are the appropriate constraint functions for the generalized Cra^vford- 
Ferguson family. One may show easily that 



and tnen with a little more straightforward differentiation effort that 

(55) .M. A. 

^•^-^^ dA. ur d/V ^iv ir lu 
xr 



+ 2k,A. (A«M)"-^) + 2k^A^ a. (A*"-^). 
1 ir^ ^uv 2 ir lu^ 'iv 
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for l<uy?v<k, l^ij^P^ and 1 < r < k . Having completed this 
task it is quite easy to show that 

0(t> ^ ax vy dA 'uu 

xy 

for 1 < a yf V < k and 1 £ x y < k • These are the partial derivatives 
of ^ required in Section h . 

6* Example and Discussion 

As in previous work on the orthogonal case, the results derived here 
apply equally well to principal components analysis and to maximum likeli- 
hood factor analysis. Because of the work of Jflreskog [1967], Jennrich 
and Robinson [1969], and Clarke [197O] maximum likelihood factor analysis 
has become computationally feasible* While this may enhance the popularity 
of maximum likelihood factor analysis, principal components analysis is 
too important and certainly too popular to be ignored* 

We shall consider a maximum likelihood example with direct quartimin 
rotation. Table 1 contains unrotated loading estimates obtained by 
'Jtireskog [1967] from an analysis of the correlations of 9 variables 
measured on 211 subjects* These data were originally analyzed by Emmett 



Insert Table 1 about here 



[19^9]* It was chosen here because Lawley and Maxv;e]l [1971> P* 6U ] 
have given standard errors for the loadings in Table 1 using Lawley' s 
[1967] formulas* We have verified these results which indicates that in 
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all probability Lawley's formulas have been properly implemented both here 
and by Lawley and Maxwell. Since these formulas are not simple, this is 
worth verifying. The verification was exact to within one digit in the 
last decimal place presented in Lawley and Maxwell's text. 

Table 2 contains a direct quartimin rotation of the loadings in 
Table 1 together with the matrix T which produced the transformation. 
This is a fairly clean rotation. Variables 5, and 6 appear to be 
primarily associated with factor 2, variable 8 with factor 5, and the 
remaining with factor 1. But how stable are the factor loading estimates? 
Table 5 contains standard errors computed from the formulas derived here 
together with those of Lawley.* The standard errors are pleasantly, almost 
surprisingly, small. They range from .056 to .l^il. This is similar to 
what happened in the orthogonal case [Archer & Jennrich, 1975] using a 
different set of data but one with roughly the same number of variables 
and sample size. To our knowledge this is the first time that standard 
errors for obliquely rotated factor loading estimates have been published. 



Insert Tables 2 and 5 about here 



The standard errors presented give a quick indication of the stability 
of the rotated loading estimates. For example one may use the standard 
errors in Table 5 to scan Table 2 for loadings which are significantly 
different from zero while accounting for the fact that he is scanning. 
To this end we observe that asymptotically at least 

*There has been a recent indication that the Lawley [1967] formulas 
may have a minor error. This does not affect the theory developed here, but 
it could affect the values in this example. 
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(55) 



1 ^ 

' ir 
std A. 




< X Q.CkCp - k + 1)) 



1 < i < P . 



1 < r < k 



ir 



is a 95^ simultaneous confidence interval for all pk loadings • Here 
X Qc(v) denotes the square root of the 95^ probability point of the chi- 
squared distribution with v degrees of freedom. In our example 
V = k(p - k + l) = 21 so 



The entries in Table 2 which after division by the corresponding entries 
in Table 3 exceed this value have been marked with an asterisk* They 
are the estimates which are significantly different from zero at 
the ^% probability level. It is interesting to note that since the median 
standard error in Table 3 is •O67, the median significant deviation from 
zero is .383* This is not too far from the value .30 suggested by the "rule 
of thirty" and often found in factor analytic studies. As can be seen by 
multiplying each entry in Table 3 by 5*72 the rule of thirty represents 
a fairly gross approximation but not one which is completely wide of the 
mark* Standard errors depend, of course, on sample size. For this reason 
the rule of thirty is often used with the understanding that, as here, there 
are about ten observations per loading. 



(56) 



X (21) = 732?f = 5.72 
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TABUE 1 

Unrotated Standardized Maximum Likelihood Loadings 



Factor 

Variate I II III Commaoality 



1 


.66k 


.321 


.07!^ 


.550 


2 


.689 


.2l^7 


-.193 


.573 


3 


'h93 


.302 


-.222 


.383 


k 


.837 


-.292 


-.055 


.788 


5 


.705 


-.315 


-.153 


.619 


6 


.819 


-.377 


.105 


.823 


7 


.661 


.396 


-.078 


,600 


8 


.1^58 


.296 


.1^91 


.558 


9 


.766 


.1^27 


-.012 


.769 



TABLE 2 

Direct Quartimin Rotation of the Loadings in Table 1 



Factor 



Variate I II III Commonality 



1 


.596* 


.081^ 


.191 


.550 


2 


.705* 


.151 


-.101^ 


.573 


3 


.690* 


-.056 


-.ll«0 


.585 


k 


.102 


.827* 


-.025 ■ 


.788 


5 


.095 


.71^9* 


-.165 


.619 


6 


.087 


.952* 


.109 


.825 


7 


.772* 


-.025 


.01^5 


.600 


8 


.201^ 


.057 


.611 


.558 


9 


.811* 


.019 


.127 


.769 



Transformation Matrix T 



.1^61 

1.055 
-.650 



.625 
-1.069 
.180 



.081 
.182 
1.059 
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TABLS 5 

Standard Errors for the Rotated Loadings in Table 2 



Factor 



Variate 


I 


II 


III 


1 


.07l< 


.082 


'.096 


2 


.069 


.082 


.075 


5 


.072 


.055 


.067 


k 


.051^ 


.081^ 


.075 


5 


.065 


.069 


.oin 


6 


.01^6 


.058 


.056 


7 


.061^ 


.056 


• lljl 


8 


.06h 


.081 


.116 


9 


.Oh6 


.050 


.059 



